Bayesian Variable Selection in Regression with Networked Predictors
نویسندگان
چکیده
We consider Bayesian variable selection in linear regression when the relationships among a possibly large number of predictors are described by a network given a priori. A class of motivating examples is to predict some clinical outcomes with high-dimensional gene expression profiles and a gene network, for which it is assumed that the genes neighboring to each other in the network are more likely to participate together in relevant biological processes and thus more likely to be simultaneously included in (or excluded from) the regression model. To account for spatial correlations induced by a predictor network, rather than using an independent (and identical) prior distribution for each predictor’s being included in the model as implemented in the standard approach of stochastic search variable selection (SSVS), we propose a Gaussian Markov random field (MRF) and a binary MRF as priors. We evaluate and compare the performance of the new methods against the standard SSVS using both simulated and real data.
منابع مشابه
Bayesian forecasting with highly correlated predictors
This paper considers Bayesian variable selection in regressions with a large number of possibly highly correlated macroeconomic predictors. I show that by acknowledging the correlation structure in the predictors can improve forecasts over existing popular Bayesian variable selection algorithms.
متن کاملBayesian variable selection in quantile regression
In many applications, interest focuses on assessing relationships between predictors and the quantiles of the distribution of a continuous response. For example, in epidemiology studies, cutoffs to define premature delivery have been based on the 10th percentile of the distribution for gestational age at delivery. Using quantile regression, one can assess how this percentile varies with predict...
متن کاملBayesian Variable Selection with Related Predictors
In data sets with many predictors, algorithms for identifying a good subset of predic-tors are often used. Most such algorithms do not account for any relationships between predictors. For example, stepwise regression might select a model containing an interaction AB but neither main eeect A or B. This paper develops mathematicalrepresentations of this and other relations between predictors, wh...
متن کاملBayesian variable selection and the Swendsen-Wang algorithm
The need to explore model uncertainty in linear regression models with many predictors has motivated improvements in Markov chain Monte Carlo sampling algorithms for Bayesian variable selection. Traditional sampling algorithms for Bayesian variable selection may perform poorly when there are severe multicollinearities amongst the predictors. In this paper we describe a new sampling method based...
متن کاملBayesian partial factor regression
A Bayesian linear regression model is developed that cleanly addresses a long-recognized and fundamental difficulty of factor analytic regression – the response variable could be closely associated with the least important principal component. The model possesses inherent robustness to the choice of the number of factors and provides a natural framework for variable selection of highly correlat...
متن کامل